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1 GCAC6AG6AACAGAACACTTTCTCATGTCCAGGGTCAGATTACAAGAGCACTCAAGACTT 60 

61 TACTGACGAAAACTCAGGAAATCCTCTATCACAAAGAGGTTTGGCAACTAAACTAAGACA 120 

121 TTAAAAGGAAAATACCAGATGCCACTCTGCAGGCTGCAATAACTACTACTTACTGGATAC 180 

181 ATTCAAACCCTCCAGAATCAACAGTTATCAGGTAACCAACAAGAAATGCAAGCCGTCGAC 240 



241 AATCTCACCTCTGCGCCTGGGAACACCAGTCTGTGCACCAGAGACTACAAAATCACCCAG 300 

6NLTSAPGNTSLCTRDYKI TQ 25 

301 GTCCTCTTCCCACTGCTCTACACTGTCCTGTTTTTTGTTGGACTTATCACAAATGGCCTG 360 

26VLFPLLYTVL FFVG LITNGL 45 

361 GCGATGAGGATTTTCTTTCAAATCCGGAGTAMTCAMCTTTATTATTTTTCTTAAGAAC 420 

46AMRIFFQIRSKSNFI IFLKN 65 

421 ACAGTCATTTCTGATCTTCTCATGATTCTGACTTTTCCATTCAAAATTCTTAGTGATGCC 480 

66TVISDLLMILTFPFKILSDA 85 

481 AMCTGGGMCAGGACCACTGAGMCTTTTGTGTGTCAAGTTACCTCCGTCATATTTTAT 540 

86KLGTGPLRTFVCQVTSV I FY 105 

541 TTCACAATGTATATCAGTATTTCATTCCTGGGACTGATAACTATCGATCGCTACCAGAAG 600 

106 FTMYISISFLGLITIDRYQK. 125 

601 AcCACCAGGCCATTTAAAACATCCAACCCCAAAAATCTCTTGGGGGCTAAGATTCTCTCT 660 

126 TTRPFKTSNPKN LLGA KI LS 145 

661 GTTGTCATCTGGGCATTCATGTTCTTACTCTCTTtGCCTAACATGATTCTGACCAACAGg 720 

146 VVIWAF MFLLSLPNMILTNR 165 

721 . .CAGCCGAGAGACMGAATGTGaAGAAaTGCTCTTTCCTTAAATCAGAGTTCGGTCTAGTC 780 

166 QPRDKNVKKCSFLKSEFGLV 185 

781 TGGCATGAAATAGTAMTTACATCTGTCMGTCATTTTCTGGATTMTTTCTTAATTGTT 840 

186 W H E I V N Y I C Q V I F W I N F L I V 205 
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841 ATTGTATGTTATACACTCATTACAAAAGAACTGTACCGGTCATACGTAAGAACGAGGGGT 900 

206 IVCYTLITKELYRSYVRTRG 225 

901 GTAGGTAMGTCCCCAGGAAAMGGTGMCGTCAMGTTTTCATTATCATTGCTGTATTC 960 

226 VGKVPRKKVNVKVFI I I A V F 245 

961 TTTATTTGTTTTGTTCCTTTCCATTTTGCCCGMTTCCTTACACCCTGAGCCAAACCCGG 1020 

246 FIC FVP FHFARI PYTLSQTR 265 

1021 GATGTCTTTGACTGCACtGcTGAAAATACTCTGTTCTATGTGAAAGAGAGCACTCTGTGG 1080 

266 DVFDCTAENTLFYVKESTLW 285 

1081 TTMCTTCCTTAMTGCATGCCTGGATCCGTTCATCTATTTTTTCCTTTGCAAGTCCTTC 1140 

286 LTSLNACLDPFIYFFLCKSF 305 

1141 AGAAATTCCTTGATAAGTATGCTGAAGTGCCCCAATTCTGCAACATCTCTGTCCCAGGAC 1200 

306 RNSLISMLKCPNSATSLSQD 325 

1201 AATAGGAAAAAAGAACAGGATGGTGGTGAcCCAAATGAAGAGACTCCAATGTAAACAAAT 1260 

326 NRKKEQDGGDPNEETPM* 343 

1261 TMCTMGGAMTATTTCMTCTCTTTGTGTTCAGAACTCGTTAAAGCAAAGCGCTAAGT 1320 

1321 AAAAATATTAACTGACGAAGAAGCAACTAAGTTAATAATAATGACTCTAAAGAAACAGAA 1380 

1381 GATTACAAMGCMTTTTCATTTACCT1TCCAGTATGAAAAGCTATCTTAAAATATAGAA 1440 

1441 AACTAATCTAAACTGTAGCTGTATTAGCAGCAAAACAAACGACATCCAATTGTCATGCTG 1500 

1501 CATGCAAMCTACACAGMTTCATGTTTrGgCAGAGTTTTGGCAAAATGAGTAATCATAT 1560 

1561 MTATTTACTGTMTTTTTAAMTACATTATC 1620 

1621 ^ CTMGGMGMCGATCMTTGGATATMTCTTCTTACCAAAAATGATAGTTAAAATGTAT 1680 

1681 ATATATCCTAGTCCCCTMCCaMTCCTGACCTATTGGGATACTTATAAAAATTTAAGTA 1740 

1741 AGTGGGATACACAMGAATMTMCTATTMCTTTTCATTATTAGCcAAAAACCTAAGGG 1800 

FIG.1B 



1801 AmAMCTMTTGMaCTGTAmGATTGGACTTMTTTmATGTTTATTTAGAAGAT 1860 

1861 AMGATTTMGMGACCTTTACMTAAAGAGAAGAAATATCGAAGTCATTAAAATAAGGA 1920 

1921 GACTTACTTTTATGACATTCTMTACTAAAAMTATAGAMTATTTCCTTAATTCTAGAG 1980 

1981 AMCTAGTTTTACTMTTTTTTACMCTTCMTMTACCATCACTGACACTTACCTTTAT 2040 

2041 TMTTAGCTTCTAGAAMTAGCTGCTMTTAGGTTMTGMCATTTTACCTTAGTGAAAA 2100 

2101 AMaTTAATTAAATATGATTACAAAGTTGCACAGCATAACTACTGAGAGGAAAGTGATTG 2160 

2161 ATCTGTTTGTMTTACTTGTTTGTATTGGTGTGTATAAMTACAMTTTACATTAAACTC 2220 

2221 TAAAtcattaaaAAAAAAAAAAAAAAA 2247 
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1 MQAVDNLTSAP .... GNTSLCTRDYKITQVLFPLLYTVLFFVGLITNGLA 46 

. j . : | : | . : . | | . : : : : | | | . : : | : : | | : . | | | 

3 IQMANNFTPPSATPQGNDCDLYAHHSTARIVMPLHYSLVFIIGLVGNLLA 52 

47 MRI FFQIRSKSN.FI I FLKNTVISDLLMILTFPFKILSDAKLGTGPLRTF 95 

: : : . | | . | | . : : . | | | | | : | : . . : | : |. | • • • : 
53 LVVIVQNRKKINSTTLYSTNLVISDILFTTALPTRIAYYAMGFDWRIGDA 102 

96 VCQVTSVIFYFTMYISISFLGLITIDRYQKTTRPFKTSNPKNLLGAKILS 145 

:|.:|.::||:. | ::.|:. =-111= ..:|:: l-= II : 
103 LCRITALVFYINTYAGVNFMTCLSIDRFIAVVHPLRYNKIKRIEHAKGVC 152 

146 VVIWAFMFLLSLPNMI. .LTNRQPRDKNVKKCSFLKSEFGLVWHEIVNYI 193 

153 I FVWI LVFAQTLPLLI NPMSKQEAERITCMEYPNFEETKSLPWI LLGACF 202 

194 CQVIFWINFLIVIVCYTLITKELYRSYVRTRGVGK. .VPRKKVNVKVFII 241 

.. :: ::|:::||. |. .|:| :| | :| :|. ::|| 

203 IGYVL . . PLI I ILICYSQICCKLFRTAKQNPLTEKSGVNKKALNTI ILI I 250 

242 IAVFFICFVPFHFARI PYTLSQTR . . DVFDCTAENTLFYVKESTLWLTSL 289 

251 V . VFVLCFTPYHVAI IQHMIKKLRFSNFLECSQRHSFQISLHFTVCLMNF 299 

290 NACLDPFIYFFLCKSFRNSLISMLKCPNSATSLSQDNRKKEQDGGDPNEE 339 

|.|:||||||| ||:::..::.||| . | .|:| • :. • -I 

300 NCCMDPFI YFFACKGYKRKVMRMLKRQVS . VSISSAVKSAPEENSREMTE 348 

340 TPM 342 
/ 349 TQM 351 
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1 GGCACGAGCCCACCCTGCGTCGGGCCTCAGTCAGCCCCCGGGGGAGGCCATGAACGCCAC 60 

I M N A T 4 

61 GGGGACCCCGGTGGCCCCCGAGTCCTGCCAACAGCTGGCGGCCGGCGGGCACAGCCGGCT 120 

5 GTPVAPESCQQLAAGGHSRL 24 

121 CATTGTTCTGCACTACAACCACTCGGGCCGGCTGGCCGGGCGCGGGGGGCCGGAGGATGG 180 

25 I VLHYN HSGR LAGRGGP EDG 44 

181 CGGCCTGGGGGCCCTGCGGGGGCTGTCGGTGGCCGCCAGCTGCCTGGTGGTGCTGGAGAA 240 

45 GLGALRGLSVAASCLVV LEN 64 

241 CTTGCTGGTGCTGGCGGCCATCACCAGCCACATGCGGTCGCAACGCTGGGTCTACTATTG 300 

65 LLVLAAITSHMRSQRWVYYC 84 

301 CCTGGTGAACATTACGATGAGTGACCTGCTCACGGGCGCGGCCTACCTGGCCAACGTGCT 360 

85 LVNITMSDLLTGAAYLANVL 104 

361 GCTGTCGGGGGCCCGCACCTTCCGTCTGGCGCCCGCCCAGTGGtTCCTACGGAAGGGCCT 420 

105 LSGARTFR LAPAQWFLRKG L 124 

421 GCTCTTCACCGCCCTGGCCGCCTCCACCTTCAGCCTGCTCTTCACTGCAGGGTTGCGCTT 480 

125 LFTALAA STFSLLF TAG LRF 144 

481 TGCCACCATGGTGCGGCCGGTGGCCGAGAGCGGGGCCACCAAGACCAGCCGCGTCTACGG 540 

145 ATMVRPVAESGATKT SRVYG 164 

541 CTTCATCGGCCTCTGCTGGCTGCTGGCCGCGCTGCTGGGGATGCTGCCTTTGCTGGGCTG 600 

165 F I G L CW L LAALLGM LP L LGW 184 

; 601 GAACTGCCTGTGCGCCTTTGACCGCTGCTCCAGCCTTCTGCCCCTCTACTCCAAGCGCTA 660 

185 NCLCAFDRCSS LL PLYSKRY 204 

661 CATCCTCTTCTGCCTGGTGATCTTCGCCGGCGTCCTGGCCACCATCATGGGCCTCTATGG 720 

205 ILFCLVIFAGVLATIMGLYG 224 

721 GGCCATCTTCCGCCTGGTGCAGGCCAGCGGGCAGAAGGCCCCACGCCCAGCGGCCCGCCG 780 

225 AIFRLVQASGQKAPRPAARR 244 
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781 CAAGGCCCGCCGCCTGCTGAAGACGGTGCTGATGATCCTGCTGGCCTTCTTGGTGTGCTG 840 

245 KARRLLKTVLMILLAFLVCW 264 

841 GGGACCACTCTTCGGGCTGCTGCTGGCCGACGTCTTTGGCTCCAACCTCTGGGCCCAGGA 900 

265 GPLFGLLLADVFGSNLWAQE 284 

901 GTACCTGCGGGGCATGGACTGGATCCTGGCCCTGGCCGTCCTCAACTCGGCGGTCAACCC 960 

285 YLRGMDWILALAVLNSAVNP 304 

961 CATCATCTACTCCTTCCGCAGCAGGGAGGTGTGCAGAGCCGTGCTCAGCTTCCTCTGCTG 1020 
305 IIYSFRSREVCRAVLSF LCC 324 

1021 CGGGTGTCTCCGGCTGGGCATGCGAGGGCCCGGGGACTGCCTGGCCCGGGCCGTCGAGGC 1080 
325 GC LRLGMRGPGDC LARAV E A 344 

1081 TCACTCCGGAGCTTCCACCACCGACAGCTCTCTGAGGCCAAGGGACAGCTTTCGCGGCTC 1140 
345 HSGASTTDSSLRPRDSFR GS 364 

1141 CCGCTCGCTCAGCTtTCGGATGCGGGAGCCCCTGTCCAGCATCTCCAGCGTGCGGAGCAT 1200 
365 RSLSFRMREPLSS ISS VRSI 384 

120 1 CTGAAGTTGCAGTCTTGCGTGTGGATGGTGCAACCACCGGGTGCGTGCCAGGCAGGCCCT 1260 
385 * 385 

1261 CCTGGGGTACAGGAAGCTGTGTGCACGCAACCTCGCCCTGTATGGGGAGCAGGGAACGGG 1320 

1321 ACAGGCCCCCATGGACTTGCCCGGTGGCCTCTCGGGGCTTCTGACGCCATATGGACTTGC 1380 

1381 CCATTGCCTATGGCTCACCCTGGACAAGGAGGCAACCACCCCACCTCCCCGTAGGAGCAG 1440 

' 1441 AGAGCACCCTGGTGTGGGGGCGAGTGGGTTCCCCACAACCCCGCTTCTGTGTGATTCTGG 1500 

1501 GGAAGTCCCGGCCCCTCTCTGGGCCTCAGTAGGGCTCCCAGGCTGCAAGGGGTGGACTGT 1560 

1561 GGGATGCATGCCCTGGCMCATTGMGTTCGATCATGGTAAAAAAAAAAAAAAAAAAAAA 1620 

1621 AAAAAAAAAAAAAAAAA 1637 
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1 MNATGTPVAPESCQQLAAGGHSRLIVLHYNHSGRLAGRGGPEDGGLGALR 50 

|.:|:.|:... =11 111-1 = 1- .".|: " • 

1 MGPTSVPLVKAHRSSVSDYVNYDIIVRHYNYTGKLNISADKEN . SIKLTS 49 

51 GLSVAASCLVVLENLLVLAAITSHMRSQRWVYYCLVNITMSDLLTGAAYL 100 

.: : :|:::|||::|| -| . .: =1 :|| : • I : • : I I I I • I • I I 
50 VVFILICCFIILENIFVLLTIWKTKKFHRPMYYFIGNLALSDLLAGVAYT 99 

101 ANVLLSGARTFRLAPAQWFLRKGLLFTALAASTFSLLFTAGLRFATMVRP 150 

|::|.|MIII|.| : I • I I • I I • I I I I -I I- H :: 
100 ANLLLSGATTYKLTPAQWFLREGSMFVALSASVFSLLAIAIERYITMLKM 149 

151 VAESGATKTSRVYGFIGLCWLLAALLGMLPLLGWNCLCAFDRCSSLLPLY 200 

..|. .. |:: :|: ||::. :|| 1 1 : : I I I I : : I : • • I I - I I I I 
150 KLHNGS . NNFRLFLLISACWVISLILGGLPIMGWNCISALSSCSTVLPLY 198 

201 SKRYILFCLVIFAGVLATIMGLYGAIFRLVQASGQKAPRPAARRKARR . . 248 

hlllll .:|- :| -h II- |:.||...:.: • • • -ll-l 
199 HKHYILFCTTVFTLLLLSIVILYCRIYSLVRTRSRRLTFRKNISKASRSS 248 

249 LLKTVLMI LLAFLVCWGPLFGLLLADVFGSNLWAQEYLRGMDWI LA 294 

|||||:::| - 1 = - I I = I I 1 III II h • ' -I — I- 
249 ENVALLKTV 1 1 VLSVFI ACWAPLFI LLLLDV . GCKVKTCDI LFRAEYFLV 297 

295 LAVLNSAVN P 1 1 YS FRSREVCRAVLSFLCCGCLRLGMRGPGDC LARAV EA 344 

| | | | | I : • I I I I I • : • : I = | | I • -I : I" I . 

298 LAVLNSGTNPI I YTLTNKEMRRAFIRIMSCCKCPSG . . DSAGKFKRPI IA 345 

345 ...HSGASTTDSSLRPRDSFRGSRSLSFRMREPLSSIS 379 

| ....:|| ..:|. |... .: :-l I 

'346 GMEFSRSKSDNSSHPQKDE . .GDNPETIMSSGNVNSSS 381 
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